CDS

Accession Number TCMCG075C26141
gbkey CDS
Protein Id XP_017983604.1
Location complement(join(5743353..5743445,5743744..5744595,5744705..5744785,5744870..5744956,5745071..5745180,5745437..5745539,5745717..5745893,5746129..5746204,5746401..5746485,5746558..5746752,5746945..5747197))
Gene LOC18588591
GeneID 18588591
Organism Theobroma cacao

Protein

Length 703aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018128115.1
Definition PREDICTED: uncharacterized protein LOC18588591 isoform X2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category D
Description PP-loop family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R09597        [VIEW IN KEGG]
KEGG_rclass RC02633        [VIEW IN KEGG]
RC02634        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko03016        [VIEW IN KEGG]
KEGG_ko ko:K04075        [VIEW IN KEGG]
EC 6.3.4.19        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGCGCGAGGCCTACTCCTTTGCTCACAAACCAGAACCATAGCCAAGCTTTCATTCTCACTCCCAAATCGCAAATTCCGGTTCAAGCCCCACTATCATCACCTTTACTATGAACATCAAAATCTTCCTTCAACGCGCGTCTTTTGCCACTGTGTTAGCTCCCAACCGTCAACCGTAGTAGATATGGCCAAGTACAACGAAGCTTTCTCCAGGCGAATGGCCATGGCCGGCCTCAAACCCCACAACCACATCGCTTTAGGAGTATCTGGGGGACCGGATAGCATGGCTTTGTGTGTTCTAGCAGCTAATTGGAAAACTGAAGGCCTATATGGCAGTGACAAGAGTGGAAATTGCATTGATGGTCTCTTGGCAATAATTGTTGATCATGGGCTACGTCCAGAGAGCAAAGATGAAGCAAGTTTAGTGGGACATCGGGTTGCAGAAATTGGAATCAGATTTGAGATTGCCCGTTGTGATTGGTCAAATGGCAAGCCAAAACAAGGTCATTTGCAAGAAGCTGCTCGTGACATGAGGTATAAAATATTTCAGGATGTTTGTATGCAAAACCAGATCAGTGTTTTACTTGTTGCACATCATGCAGATGACCAGGCTGAGTTATTCATTCTTAGATCATCTCGTGATAGTGGGGTCCTTGGACTTGCTGGCATGGCATTCACATCTCAAGTGTTCTCTTCACATACATATTTTAGTAACAAAGATTGGAAATGTCATAGCATTCTTCTAGTGCGGCCACTTCTGGATTTTTCAAAAGAAGACATGTACAAGATATGTCAAGGGAGTAACCATGATTGGGTTGAGGATCCAACAAATCGAAGTTCATTATTTGCTCGGAATAGGATTCGGATGTCACTGGGAAATTTGTCATCTTGTATCTTTAAGTCTGAACTACAAGCAGTTATTTCTGCCTGTCGTAAAACACGCACCTATGTTGATCAAATTTGTAACAATTTGATAAATCAGACTGTCACAATAATGGAAGGTTATGCAGTTATCGATTTAGAGGCACTTGATCCATCAAAAATTGAGGACATATGCCTGTCTAAATTCATCGCATTGGTTTTACAGTATATTTCACAAAGGCAGAGGCCAATTAGAGGTAGTACTTCAAAATTGCTGTTGCAATACATTCGTACCATCCCATGCAAGACCTCCCTTACTGCTGCTGGTTGCTACATTTGTCCAGCTCCTGGGTCTAAGGGTACCAAAGCTCTGATATGCTGCTCTGTTCATGGTCCTCTGCCTTCAAAGGCAGAATTATTTCAAGCACACTCTAGTGAAGAGCAGAAGCATTGTTTTTCAAATGAGTTGGAACAAATTATAGCAAATGGAAAATCATATTCTATTAACTTGGTCCCTAATGCATCCAAAGTGCAGTTTTTGAACATGGGGTCTGCGTCAGTTCTAGATGAAGCCCAGAGACTAGATATTGTCAGTGAGTCAACCTATAGAAACTCTATTTTATTGCAAAAGGCGGAAGTCAAACGTTTCAAGTCTAAAACTGATGAACTTGTGTCTGAATGTAAGGCAAAGCAGGAAGCTGAACATGTTGCTGCATTTCTGAGTGAACCACTTCTCCATGGGCAAACATGCTTCTTCATGAACCGGTTCATTATCTCATGGAAAGTAAGCAAAGAAATTTCTTGGAATGTTTTTCCCAGAGAAGCTTATTGTCTCTCATATTTGGGAAGGGAAAGTCAGCACAGTCATTGCTGTTGTATAAAGAGGCATGACATGGTAGCCAAGATTCGTCCCATGATTGATGCTGATTGGCTCTATCTTGCCGAGTTGTTGAAGTGGCCAAGTTCAGATAATTTTGAAGCGACAAAACTTCCTTTCTCTATAGAAGCAAATCCGTTAACCAAGAAGACAAAAATATGCTCAGATTATTCAAGGTTATCTGCAAAAGTAGCTCTCAAATCACTGAAATCTGTCCCTGCTGCAGCAAGAAGAAGCATTCCGGTCCTGGTCAATCATGATGGACAGCTACTTGGCATCCCAAGCATTGGCTTTAACCATTGCCCTTTCTTGATGACATCTGCCGTATTCAAGCCAAGAGTACCGCTTGGAGGGGGACACAGTTCCTTTCTTTAG
Protein:  
MARGLLLCSQTRTIAKLSFSLPNRKFRFKPHYHHLYYEHQNLPSTRVFCHCVSSQPSTVVDMAKYNEAFSRRMAMAGLKPHNHIALGVSGGPDSMALCVLAANWKTEGLYGSDKSGNCIDGLLAIIVDHGLRPESKDEASLVGHRVAEIGIRFEIARCDWSNGKPKQGHLQEAARDMRYKIFQDVCMQNQISVLLVAHHADDQAELFILRSSRDSGVLGLAGMAFTSQVFSSHTYFSNKDWKCHSILLVRPLLDFSKEDMYKICQGSNHDWVEDPTNRSSLFARNRIRMSLGNLSSCIFKSELQAVISACRKTRTYVDQICNNLINQTVTIMEGYAVIDLEALDPSKIEDICLSKFIALVLQYISQRQRPIRGSTSKLLLQYIRTIPCKTSLTAAGCYICPAPGSKGTKALICCSVHGPLPSKAELFQAHSSEEQKHCFSNELEQIIANGKSYSINLVPNASKVQFLNMGSASVLDEAQRLDIVSESTYRNSILLQKAEVKRFKSKTDELVSECKAKQEAEHVAAFLSEPLLHGQTCFFMNRFIISWKVSKEISWNVFPREAYCLSYLGRESQHSHCCCIKRHDMVAKIRPMIDADWLYLAELLKWPSSDNFEATKLPFSIEANPLTKKTKICSDYSRLSAKVALKSLKSVPAAARRSIPVLVNHDGQLLGIPSIGFNHCPFLMTSAVFKPRVPLGGGHSSFL